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Abstract. 

This paper explores the reconstruction of drift or diffusion coefficients of a 
scalar stochastic diffusion processes as it starts from an initial value and reaches, 
for the first time, a threshold value. We show that the distribution function 
derived from repeated measurements of the first-exit times can be used to formally 
partially reconstruct the dynamics of the process. Upon mapping the relevant 
stochastic differential equations (SDE) to the associated Sturm-Liouville problem, 
results from Gclfand and Levitan 1101 can be used to reconstruct the potential of 
the Schrodinger equation, which is related to the drift and diffusion functionals 
of the SDE. We show that either the drift or the diffusion term of the stochastic 
equation can be uniquely reconstructed, but only if both the drift and diffusion 
are known in at least half of the domain. No other information can be uniquely 
reconstructed unless additional measurements are provided. Applications and 
implementations of our results are discussed. 



1. Introduction 

Consider a diffusive stochastic process X t , with an initial condition Xq. Once X t 
reaches a certain threshold value X* , the process is stopped and restarted at X . The 
distribution of times for X t to first reach X* across the ensemble of measurements, 
the first-exit time distribution, arises in many physical applications Mean first- 

exit times are readily calculated from explicit forms for the dynamical rules controlling 
the process X t . However, in many applications, the underlying physics is unknown, 
complex, and/or very difficult to model. In such instances, the question arises as 
to whether one can reconstruct the form of the governing equations or equation 
parameters that best fit measured data (such as first-exit time distributions). In 
this paper, we analyse a general scalar diffusive process to formally determine how 
much of the drift and diffusion functions can be determined from first-exit time data. 

Many first passage time problems can be recast as inverse problems. We describe 
two physical realizations that motivate this work: transmembrane voltage spike 
frequencies in neurons [111 13 lj , and first passage times over a molecular barrier in 
the rupture of chemical bonds |5J . The transmembrane voltage of a nerve cell is 
routinely measured and typically exhibits repeated, sharp spikes. A class of reduced 
models for the transmembrane voltage V is described by 



^l=I(V t ) + g(V t )r, t , 



(1) 
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where I(V t ) and g(V t ) are typically polynomial functions of V, and rj t is a noise 
arising from interactions with the many other connected neurons. This Langevin- 
type equation is to be appropriately interpreted with Stratonovich calculus [15). An 
additional, instantaneous nonlinearity that gives rise to a spike and voltage reset 
is implicitly included by imposing a threshold voltage V* . This nonlinearity arises 
ultimately from the nonlinear dynamics of ion channels that span the cell membrane 
|11| . Upon reaching V* , the system instantaneously spikes and resets to the value 
Vq. Various forms of I(Vt) and g(Vt) have found wide use. For example, when the 
noise r\ t arising from other connected nerve cells changes the nerve cell's membrane 
conductivity, g(Vt) <x Vt + const. Both linear (linear integrate-and-fire) and quadratic 
(quadratic integrate-and-fire) forms are used to model I(V t ) ^21^, although it is 
often unclear which model of I(Vt) is most appropriate [21]. Instead of solving for 
the spike times for presupposed I(V t ) and g(V t ) [231, our goal is to infer as much as 
possible about the nonlinear current- voltage relationship I(Vt) and coupling to noisy 
inputs g(Vt), given a measured spike time distribution |29| . 

Another application where first exit times can be used to reconstruct models is in 
the single molecule measurements of macromolccular detachment, or bond rupturing. 
An example of this rapidly developing field is the accelerated rupturing of bonds 
upon application of a dynamic load between two macromoleculcs, such as the ligand- 
receptor complex biotin-streptavidin [201 177] . The mean times to bond breaking have 
been measured (20) and modelled [27J . This "Kramers problem" 1161 12^| , can also 
be studied as an inverse problem. Bond rupture can be modelled as the irreversible 
passage of the relevant molecular coordinate X over a barrier from a metastable 
state. Thermal bombardments agitate both the molecular distance coordinate directly 
and the interaction potential. In the ovcrdamped, or Markovian limit, the distance 
coordinate obeys the Langcvin equation 

^± = -U'(X t ) + g(X t ) Vt , (2) 

where U(X t ) is the effective interaction potential of the rupturing bond, and g(X t ) 
represents a force that arises from a fluctuating potential driven by the white diffusion 
noise 7y t (again, the appropriate convention is the Stratonovich difference rule). The 
forward problem of bond rupture, or detachment, using simple fixed potentials U(X t ) 
OEIj an d with explicit forms for fluctuating potentials [7], have been well-studied. 

Both problems discussed above (see Fig. QJ, as well as many others, can be 
modeled using a scalar stochastic process where the first-exit time distribution obeys 
the backward Kolmogorov equation. We show in the following sections, that the 
measured first-exit time distribution (given a fixed initial condition) along with 
knowledge of the drift and diffusion parameters in part of the domain, is sufficient 
to uniquely determine one of them in the entire domain provided the other one is 
known. Our analysis gives a formal procedure for reconstructing drift or diffusion of 
the stochastic process. It is based on transforming the equation into a Schrodinger type 
equation and on analytical (more precisely meromorphic) continuation of measured 
data to obtain the spectral data of the Schrodinger equation. Although uniqueness is 
demonstrated in the aforementioned framework, the necessary analytical continuation 
in the reconstruction shows that the inverse problem is severely ill-posed [SJ. 
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Figure 1. (a) The transmembrane potential is instantaneously reset to Vb as 
soon as it crosses V*. We demonstrate that the functional form of the voltage 
dynamics can be be determined only between Vb and V* (the unshaded region), 
(b) Passage over a metastable barrier. The first-passage, bond-breaking times 
can be used to construct the potential U(X) and and how it fluctuates (dashed 
curve) . 

2. Stochastic Equations and Main Result 

Consider the general stochastic process 

dX t = fx(X t )dt + a(X t )dWt, (3) 

where dWt — r) t dt is the Wiener measure, and the ltd interpretation is used |15l I22j . 
The drift term, fi(X), is assumed to have sub-linear growth at infinity, and is of class 
C 1 (R). We also assume that the variance, a(X), is uniformly bounded from above and 
below by positive constants, and is of class C 2 (K). These assumptions are sufficient 
to ensure that the process (j3J) admits a unique solution [22| . 

In the application to spiking action potentials in neurons, Xt = Vt is the 
transmembrane voltage at time t. The potential immediately after reset is denoted 
Xq = Vq. The corresponding drift and diffusion terms are defined as /j,(X) = 
I(X) + ^g(X)g'(X) and a(X) = g(X), respectively, since (0 is in the Ito form and Q 
should be interpreted in the Stratonovich sense [22] ■ For a more detailed derivation 
of the the diffusion process in the context of action potential modeling, we refer the 
reader to number of standard works [111 1141 1171 l3*T] . In the application to chemical 
bond rupturing, X denotes the reaction coordinate along an effective molecular energy 
landscape (cf. Fig. 1(b)), Xq is an initial bond displacement, and we identify the drift 
in © with -U'(X) + \g{X)g'{X), 

Let us now consider the interval A = (X_, X*) and the first exit time t* of the 
process X t ; i.e., t* is the first time of exit of X t out of [1E1E2- First 

exit time measurements can then be used to reconstruct properties of the diffusion 
process. For instance, it is known that knowledge of Px{Xf = X*) (the probability 
that X t exits A at X* conditional on it starting at X £ A) and Ej[f ] (the expected 
time it takes for X t to exit A starting at X), uniquely determines \i and a. We refer 
to Breiman ^ for details on the so-called natural time scale and speed measure of 
diffusion processes. 
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In the applications presented here, the situation is different: We have access to 
the full distribution of the first-exit times, but only for processes starting at X = Xq 
(the reset potential or the initial relative molecular coordinate). The distribution of 
the exit times can be conveniently written as the solution to a partial differential 
equation (the backward Kolmogorov equation) [SJ El El El EHl EI] ■ Let us denote 
by w(X; A) the Laplace transform (from the time variable t to the dual variable A) of 
the probability density w(X;i) of the first-exit times t from I 6 A of the diffusion 
process starting at X = Xq. For simplicity, we will assume reflecting boundary 
conditions for the diffusion at X = X- , and consider termination of the process only 
at X*. Following El EI EEL w e find that w(X; A) solves the following backward 
Kolmogorov equation, 

- ^ w"(X; A) + n(X)w'(X; A) = \w(X; A), A_ < X < X* , A > 0, 

where dw(X-\\) / dX = corresponds to vanishing probability flux at X = X_. 
Although we restrict our analysis to a reflecting lower bound, depending on the 
physical application, other boundary conditions can be imposed at an arbitrary lower 
bound. The boundary condition w(X*; A) = 1 reflects the fact that w(Xq = X*, t) = 
S(t). Since w(Xq; t) is directly measured, w(Xq; A) is exactly determined for all A > 0. 
Using an affine change of variables, (X — X_)/ (X* — X_) — > x, we rescale the domain 
to x <G [0, 1] and recast Eq. Q as 

a 2 (x) 

— ^-u>"(x; A) + n(x)w'(x; A) = \w(x; A), < x < 1, A > 0, 

dw(0;\) & 
w(l;X) = l, dx =0, 

where x is now the initial position of the process. We still use the notation a and 
/j, for the diffusion and drift terms in the rescaled variables. For example, in the 
neuron spiking problem, the voltage threshold is at 1, the inhibitory threshold (the 
lower bound for the voltage) is at 0, and the reset voltage that restarts the process 
is at x = xq. This problem admits a unique solution provided a remains uniformly 
bounded from above and below and fj, remains bounded. 

In many instances /x(x) and o~(x) are known from a well-accepted model, and 
the function w(x; A) can be computed directly. Now consider the case where /J-(x) 
and o~(x) are unknown, but w(x = xq] A) is measured at a single initial position 
x = (X — X_)/(X* — XJ). What can one reconstruct (with respect to fj,(x) and 
o~(x)) from a knowledge of w(xq; A) for all A > 0? In answering this question, we arrive 
at the following: 

Let < Xb < 1 be the point defined in (|18|l below and let A C (0, 1) be a countable 
set of Lebesgue measure in (0, 1), defined in iL'MI) below. We obtained the following 
related results: 

Theorem 1 Assume that fi(x) and o~(x) are known on (0, Xb) and that the 
measurement point x$ € (0, 1)\A. //, in addition, either [i(x), o~(x), or fi(x)/o~(x) 
is known entirely on [0,1] , then both fj,(x) and o~(x) are uniquely determined on [0,1] 
by measurement of w(xq; A), for all A > 0. 

Theorem 2 Assume that fJ,(x) and a~(x) are known on (0,xq) and that the 
measurement point xq > Xb- If, in addition, either (i(x), cr(x), or fi(x) /cr(x) is known 
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on [0,1], then both fJ,(x) and o~(x) are uniquely determined on [0,1] by measurement 
of w(xq;X), for all A > 0. 

Both theorems say that if either fi or a is known on [0, 1], while the other is known 
only on half of the domain, we can reconstruct this other function also on the entire 
domain. Since Xb is implicitly defined by a (cf. Eq. I18|) , it is an a priori constraint on 
the size of the domain that one can reconstruct from the measurements w(xq; A). In 
Theorem 1, the measurements are made at an almost arbitrary point xq G (0, 1). We 
have to remove a countable set of points, which is also defined implicitly as it depends 
on a and /i. This constraint is no longer necessary in Theorem 2, when xq G (xb, 1). 

The conclusions of both results hold true if the domains (0, Xb) and (0, Xq) of a 
priori knowledge of the coefficients arc replaced by 1) and (xo,l), respectively 
with xq < Xb in the latter case. We do not explicitly consider these cases. The 
main steps of the proof are the following. First we use a transformation similar to 
the Liouville transformation (used to obtain the canonical form of Sturm-Liouvillc 
problems [5]) to map Eq. (JSJ into a Schrodinger equation (Section^. Next, we show 
that a measurement of w(xo] A), for all A > 0, is sufficient to obtain a single eigenvalue 
spectrum of the Sturm-Liouville problem (when xq G" A in the setting of Theorem 1). 
This allows us to use results from Gelfand and Levitan ^01 to verify uniqueness of the 
reconstruction (Section and explicitly extract fj,(x) or <j(x) (Section EJ. The above 
theorems require to know the coefficients on half of the domain before one of them 
can be reconstructed on the other half. An explicit asymptotic expansion presented 
in section [5] shows that this a priori knowledge is indeed necessary to obtain a unique 
reconstruction. In section[^lwe present a result showing that the reconstruction of one 
of the coefficients fj,(x) or a(x) is unique over the whole domain (0, 1) provided that 
they are known on an arbitrarily small domain (0, n) and provided that a sequence of 
measurements w(x m ; A) is available for A > and x m , m G N, judiciously chosen. In 
this result, a minimum of M measurements is required to reconstruct fi or a provided 
that they are known a priori on an interval close to x = of size 2 _ju . Finally section 
considers possible implementations of the method. 



3. Mapping to Schrodinger form 

To recast Eq. (0 as a Schrodinger equation, we effect the following change of variables 
w(x) = (/it) o h{x). (6) 

Thus, 

w'(x)^ti(fu)'oh, w"(x) ti'{fu)' oh+(h') 2 {fu)" oh, (7) 
and Eq. 10) becomes 

2 

h'f (fu)"oh+ —h" + pLh' (fu)'oh = X(fu)oh. (8) 
To normalise the second-order term, we impose the constraint 

< x < 1, 

°(?) (9) 



h'(x) 



h(Q) = 0. 
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Since a is bounded from above and below by a positive constant, x — > h(x) is a 
diffcomorphism. Upon introducing the change of variables 

y = h(x), (10) 

the points x — 0, x — Xo, and x — Xb before the change of variables become 
j/ = h(0) = 0, yo = h(xo), and j/6 = h(xb), respectively. If we define a new drift 

2 

!/(») = (y^ + M^)^- 1 ^)), < y < y* = Ml), (11) 

Eq. ||SJ| can be succinctly expressed as 

(fu)"(y) + v(y)(fu)'(y) = \(f U )(y). (12) 
To remove the drift term, we force / to be the solution to 



2f'(y) + i>(y)f(y) = 0, 0<y<y*, 
f(vo) = 1. 

Since f{y) > for y = y , f(y) > for all y G [0, y*]. We finally obtain 



(13) 



u "(y) - v(y) u (y) = Xu iy), ° <y <y* 

1 „ <9u(0;A) „ (14) 

where the potential is defined by 

v f + f" 

d{y) = - J -j J -{y)- (15) 

By assumption, o-(x) and [i(x) are known for < x < b. This implies that v(y) 
and f(y), and hence q(y), are known for < y < yb = h(xb), although y* = h(l) 
itself is generally not known (unless o~(x) is also known on [0, 1]). This undetermined 
domain size is found in the following section. 

4. Uniqueness of the potential reconstruction 

We now consider the reconstruction of the potential q(y) in the equation 

u"(y; A) - q(y)u(y; A) = Xu(y; A), < y < y* = h(l) 

1 du(0;X) (16) 

u(yH 7(r) >0 ' ~^y-^°- 

The assumption is that q(y) is known on [0, yb] as described earlier. However y* = h(l) 
depends on o~{x) throughout the domain and is unknown, as is q{y) on (0, y*). From 
measurements, we have u(yo] A) for all A > since w(xo; A) = u(yo; A) and /(j/o) = 1- 
Uniqueness of the reconstruction of q{y) on (yb 7 y*) is based on the fact that 
A — > u(y; A) is analytic with poles corresponding to values of A that are eigenvalues 
for specific spectral equations. It is known that two spectral equations are necessary 
to reconstruct the potential q(y) from spectral data [3 El El EH- However, we will 
see that our problem allows extraction of at most only one spectrum. Nonetheless, if 
the "potential" q(y) is known for half of the entire domain, one eigenvalue spectrum 
suffices for the complete and unique reconstruction of q{y) [HlEEj- 
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Since q(y) is known for < y < h(xb), we require 
K x b) = TT = —Z- 



(17) 



in order for q(y) to be completely reconstructed on [0, y*] Therefore, the original 
functions ijl{x) and cr(x) must at least be known for < x < Xb, where 



y 



-i fh(l) 



2 J V 2 

The full problem on [0,y*] becomes 



(18) 



u"{y) - q(y)u(y) = Xu(y), < y < y* = h(l) 
1 , _ du(0; A) 



<y*) 



^0, 



dy 



= 0. 



f(y* 

u(yo;A), known. 
Now, consider the associated eigenvalue problem 



(19) 



%(v) - q{y)Mv) = -*kMv), 

6jfe(O) = fc (y*)=O. 



< y < y*, k > 1 



(20) 



If the Afe in 12U|I can be determined from our measured data u(yo;X), then q(y) on 
[0, y*] can be determined upon using classic theories described in |H1 Itfl 1101 1131 1181 1^5] . 
To find the eigenvalues to l|20() , as well as the transformed threshold y* , we decompose 
the Green's function for (|19|l in terms of the eigenfunctions of l|2()|) . and obtain 



A + \ k 



(21) 



First, consider the setting of Theorem 1. Since u(y*) = l/f(y*) > and \4>' k {y*)\ > 
(since 4>k is a non-trivial solution to (120(1 ). |u(yo;A)| — > oo as A — > — A^, provided 
4>k(yo) 7^ 0. The latter constraint is not satisfied for all choices of yo. Let us define 

A. y = {y G (0, y*), there exists fc > 1 such that 4>k(y) = 0}. (22) 

Since the number of zeros of solutions to the Sturm-Liouville problem 120|) is finite, 
the set A y is countable and therefore of Lebesgue measure on (0,y*). As long as 
our initial measurement position j/o is not part of the above countable subset of values 
where the eigenfunctions of l|2(J|l have a node, we can in principle obtain one spectrum 
by analytically continuing the data u(yo] A) in A and finding the poles. The constraint 
on j/o also amounts to saying that the eigenfunctions 4>k cannot be restricted to being 
eigenvectors of the same problem posed on the domain (0, yo) instead of (0, y*). In the 
primitive variables the countable set of forbidden measurement points is thus defined 
as 

A = h- 1 (A y ) = {x e (0,1), h(x) e A y }. (23) 

Now consider Theorem 2. We can extend u(y; A) by evenness on (—y*, y*) and 
then by periodicity on K. The resulting function is then continuous in y and we deduce 
from (|2U that 

~<t>' k (yW k (y*My*Y 



A + A fc 



(24) 
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Now since n(x) and cr(x) are known on (0, Xq) then so is q(y) on (0, yo). Since u(y$, A) 
is known, we can solve <|14[) for u(y) on (0, yo) and thus obtain the value of f^(j/o! A). 
Since either \(f>k(yo)\ > or |</4(?/o)| > (for otherwise 4>k(y) = 0), we deduce that 
l u (z/o>A)| + |^(2/o; A)| — > 00 as A — > — Aj.. Under the hypotheses of Theorem 1 and 
Theorem 2, we thus find that one can reconstruct the spectrum {Afe}fc>i in 1|20|) from 
the measurements. 

Finally, we must determine y* . Asymptotic expansions of the eigenvalues for large 
k, found using the WKB approximation [3101123! have the form: 



A fc 



[(*-£ 




2 






y*. 


H 


y Jo 



q(y)dy + a k , (25) 



where the sequence a k £ I 2 , i.e., ||<2fc| = (SfeLi ?) 1 ^ 2 < 00 • Equation l|25|l implies 
that the threshold value y* can be evaluated using 

V* = lim -^=. (26) 
fc— >oo y Afc 

Thus, the domain size is reconstructed. The classical theories found in PlElEIlEj can 
then be directly applied to uniquely reconstruct the remaining, unknown part of q(y). 
Moreover, efficient numerical algorithms have been developed for reconstructing q(y) 
[H1EHGH]j and ultimately fi and a from the spectral data. However, the reconstruction 
of these spectral data from measurements is based on analytic continuation. This 
implies that the inverse problem is severely ill-posed |H| and that noise in the data 
is quite strongly amplified in the reconstruction; see section [7] Note that a similar 
approach was used in |23| in the context of the one-dimensional heat equation. 

5. Reconstruction of drift or diffusion term 

We showed in the preceding section that q(y) and y* are uniquely determined from 
the measurements provided that one set of hypotheses stated in Theorems 1-3 (cf. 
Section 6) are verified. Combining equations 1)13(1 and H15J) . we find 

v'(y) + y = Mvl o<y<y*- (27) 

Since both fj, and a are known on [0, xj,], 

\ °-(Xb) V2 J 

The Riccati equation l|27|) . along with the boundary condition 128|) admits a unique 
solution for v(y). 

Even if v{0 < y < y*) is known uniquely, one cannot extract both /i(x) and o~(x) 
independently However, if cither fi or a is known, then the other one can uniquely be 
reconstructed from the knowledge of v{y) on (0,y*). The simplest case occurs when 
only a is known, for then, so is h(x), and the reconstruction of fi follows from the 
definition (|11|) of v. On the other hand, we can deduce from (j^J and (|llfl that 



T-jjw-TfU^- (29) 

Let us assume that [ija can be expressed as a known (smooth) function of a and 
x. This includes the cases when [i is known or when ^ija is known. Since a is a 



On the reconstruction of diffusions from first-exit time distributions 



priori known to be uniformly bounded from below, equation <|29|) provides a first- 
order integro-diffcrcntial equation that can then be solved for cr(x) on (0,1). The 
reconstruction is uniquely defined, concluding the proof of the main theorem. 

The reconstruction of q(y) on y G (0, y*) clearly depends on the choice of q(y) on 
[0, yb] since it would change the spectrum through Eq. pHjl. Therefore, a physically 
incorrect model of q on [0, yb], contaminates the reconstruction of fi(x),a(x) on 
x G (0, 1), leading to physically incorrect, albeit unique, results. To be more specific, 
consider the simple case a 2 = 2, whence h(x) = x, y* = 1, and v{x) = n{x). Provided 
|^i| 2 <C Eq. IT31 implies q(x) ~ fi'(x)/2 <C 1. We then perform an asymptotic 
expansion of the eigenfunctions <pk = tfiko + 4>ki and eigenvalues Afc = Afco + Afci, where 
4>ui <C 1 and Afci -C 1 (see for instance), and obtain 



k— ^)) > and 



(t)ko(x) = cosn(k - -^jx, 



and using the Frcdholm alternative in the perturbation 

_< ^4'l + i<t>kO = Afel^fcO + AfcO</>fcl, 

we obtain 



Am 



l{x)(j)l {x)dx 



1 

2 



(j) m {x)dx 



Using the relation cos 26 = 2 cos 2 9 — 1, we deduce from the above equation that the 
knowledge of A^i allows us to obtain 

1 f 1 

fem-i = o / <?(a ; ) cos(2m — l)7rx dx. 



2 







Therefore the knowledge of one spectrum A& gives us half of the cosine Fourier 
transform of q(x), hence of fi(x). The other half needs to be known, for instance 
by imposing that we know q(x), or equivalently n(x), on (0, 1/2). 

Although we have shown what parts of fx(x) and o~(x) can be uniquely 
reconstructed from a single first-exit time distribution, in practice, finding the 
eigenvalues Afc from data to the precision required for accurate reconstruction is 
difficult. 



6. Reconstruction with multiple measurements 

We have seen in the preceding section that either /1 or a could be reconstructed from 
the knowledge of q provided the other one is known. In this section we obtain sufficient 
conditions on the measurements so that q(x) can uniquely be reconstructed on (e, 1) 
for e > 0. We show that it is sufficient to measure u(y m ;X) at a countable sequence 
of points y m with accumulation point at y = 0. These points y m correspond to points 
the transform x m = h 1 (y m ). They are constructed as follows. 
We choose yo such that yo > y* /2 and then by induction the points y m (m > 1) 
such that y m G (y m -i/2,y m —i). We assume that y m — > as m — * 00. We denote by 
y the set of sequences {y m }meN = (yo,J/i,' ■ ■) satisfying the above constraints. We 
can then show the following result 
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Theorem 3 Let us assume that q £ L 2 (0, 1), that there is a small interval [0, s] where 
q(y) is constant and that there is a known index M such that jjm < s. The parameters 
e and Dm need not be known. Then the measurements at u(y m ; A) , 1 < m < M+l, for 
{Dm} € y and A > uniquely determine q(y) on (0, y*) as well as the point {j/m}m€N 
and y* . 

Proof. By hypothesis, we know M such that j/m < £■ Consider the value of 
u(yM', A) 7^ 0. If u(yM\ A) = 0, the condition at y — would imply that u(y; A) = on 
(0, an d, hence on (yM,V*), since then u(yM',X) = |^(t/a/;A) = 0. This however 
contradicts the fact that u(y*) = 1. Upon dividing the solution u(y; A) by u(ynj', A) ^ 
we deduce that vm{d\ A) = u(y; X)/u(jjm] A) solves 

A ) ~ Q(v) v M{y; A) = Xv M (y; A), < y < y M 

dvM(^X) (30) 

vm\Vm\ A) = 1, 5 = 0. 

ay 

We then know from the proof of Theorem [21 and the asymptotic expansion (|25|l that 
the measurement of v(yM+i) uniquely determines j/m and q(y) on (0,?/m) since the 
latter is constant. 

The rest of the proof follows by induction. Let us assume that q(y) is known on 
(0, y m +i) for m+ 1 < A/. Then u{y m ; X) ^ and w m (j/; A) = u(y; X)/u(y m ; A) satisfies 
the same equation <|30|) with yM replaced by y m . Upon using Theorem [21 we deduce 
that the knowledge of v m (y m +i;X) for A > uniquely determines y m and q(x) on 
(0, y m )- This allows us to reconstruct j/o and q(y) on (0,yo) by induction. Applying 
Theorem [21 on (0, y*) one last time concludes the proof of the theorem. □ 

The same type of proof could be used with other local a priori information 
on the behavior of q(y) in the vicinity of y = 0. For instance, we can show that 
the reconstruction is feasible provided that q(x) is known on an arbitrarily small 
interval (0, e) and provided that there exists an admissible sequence (yo, j/i, • • • , J/m) 
of measurements with yM < £■ 

In both cases the reconstruction of q{y) and of the measurement points y m allows 
us to obtain some information about the original drift and diffusion terms fJ-( x ) an d 
o~(x). We can for instance prove the following result. 

Theorem 4 Assume that fi(x) and o~(x) are known on (0, 77) for some r] > and that 
w(x m ; X) is measured for X > at x m , 1 < m < M , where the constraints on {x m } m 
are that xm < T) and {y m }ni = {h(x m )} rn G y. If, in addition, either (l(x), cr(x), 
or fj,(x)/a(x) is known on [0, 1], then both fi(x) and o~(x) are uniquely determined on 
[0,1]. 

Proof. The proof follows by induction. We define yM = }i{xm) < £ = h(rf). Let us 
now define /jvf(y) as in 1)13(1 on (0, yu-i) but with boundary condition fM(yu) = 1- 
This implies that Um{vm\X) = w{xm]X) where um(j/;A) solves the Schrodinger 
equation 

u m(v) - q(y) u M(y) = Xu M (y), o < y < y M -i 

( ^ 1 s n d^/(0;A) (31) 

U M {yM-l) = 77 r > 0, — = 0. 

JKVM-i) ay 

We have seen in Thcorcm[31that q{y) can then be uniquely reconstructed on (0, J/m-i) 
since it is known on (0, yjvf)- We can now apply the rest of the proof of Thcorcm[2]on 
the interval (0,a;.M_i). This allows us to reconstruct \i and a uniquely on (0, Xm-x)- 
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Similarly the knowledge of fi and a on (0, x m ) allows us to reconstruct both coefficients 
uniquely on (0, x m -i) and the same knowledge on (0,a;o) allows reconstruction on 
(0, 1). This concludes the proof of the theorem. □ 

7. Implementation of the reconstruction 

Theorems^!!! and0]provide an explicit method to reconstruct the drift and diffusion 
coefficients fi and a from the measurements. This method is based on three steps. The 
first step consists of finding the poles of meromorphic functions of the form A — > u(y; A) 
at fixed y from its knowledge for A > 0. In the second step, the potential q{y) is 
reconstructed from the knowledge of the spectral data provided in step 1. Finally 
and o~{x) are reconstructed from q(y) by solving first-order ordinary differential 
equations. 

Step 3 is well-posed in the sense that noise in the potential q(y) arc at most 
linearly amplified in the reconstruction of n(x) and tr(x). Step 2 is also well posed 
as the reconstruction of the potential q(jj) from the spectra is stable, for instance in 
the sense that an error of order e on the I 2 norm of in (|25(1 implies an error also of 
order e in the reconstruction of q(x) in the L 2 sense |19l I25j . 

Step 1, however, is severely ill-posed in the sense that order-e errors in the 
measured generate errors greater than order e a (for all a > 0) in the I 2 norm of 
the coefficients ak in 125fl . This is a similar result to the reconstruction of diffusion 
coefficients from boundary measurements, which is known to be severely ill-posed 
^ E|- We can show a Holder- type stability result in the reconstruction of each 
eigenvalue. This means that for each eigenvalue A^, there exists oik > such that 
errors in the data of order e yield an error of order e ah on the reconstruction. 
Unfortunately, the prefactor in front of e° k grows exponentially in k so that at most 
only approximately | log e | eigenvalues can be reconstructed from oder-e accurate 
measured data. This behavior is a consequence of the results on the meromorphic 
prolongation of functions with poles, as shown by K. Miller Since we do not 

attack the numerical reconstruction here we briefly sketch the result and how it may 
be used to derive the above stability estimates. 

Let us consider the open disk D of radius R and center in the complex plane 
and a meromorphic function /(A) = u(yo; A) inside that disk with N poles, which we 
assume lie on the segment (— R, 0). We want to reconstruct these N poles from the 
measurement of the meromorphic function on the segment T = (0, R). Let us define 
by w(z) the function that takes the value on dD, the value 1 on T, and is harmonic 
on D\T. One verifies that < w(z) < 1 on D\T. Let us recast our meromorphic 
function as /(A) = -F 1 (A)/J|^ =1 (A — A^) where F(X) is analytic on D. We assume that 



Here h(X) are our measurements. The above relation indicates what we mean by error 
in the measurements and essentially says that / — h is bounded by e in the suprcmum 
norm. Upon slightly modifying the proof of Theorem 1 in [21], we obtain that it is 
possible to reconstruct the poles Afe of / on D with an error of order 



\F - Bh\ <e\B\ 



on r, 



and 



\F\<E, 



on dD. 



\8\ k \ < 





2 
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Here we have bounded B by R N on D. In the case of interest in this paper, we have 
that 

R~N 2 , \ k ~k 2 , and W k = u{y*)<t> k {yoW k {y*). 

The radius is chosen as roughly 1/2(Ajv + Ajv+i). At the boundary dD of the domain, 
we observe that F is bounded by (using Stirling's approximation) 

E-UlXt-W-iMfKN^+'e-". 

Using these values we find that 

\6X k \ <C N s w ^ 

for some positive constant C . The above constraint is sharp [21]. This shows that the 
reconstruction of any eigenvalue is Holder continuous, where the order of continuity 
a = w(Afe). This is therefore a mildly ill-posed problem. 

However, although the reconstruction of each eigenvalue is mildly ill-posed, 
collectively the reconstruction of the whole spectrum is severely ill-posed if one wants 
an approximation in the I 2 sense. Indeed for a given noise level e, the above formula 
shows that we can only reconstruct a number of eigenvalues that is proportional 
to |loge|, for the reconstructed information becomes useless as |<5Afc| > 1 and the 
corresponding coefficients a k in ill'oll are not reconstructed at all. Now assuming that 
w(X k ) = 1, which is the best we can expect, we observe that C N e = 1 for N ~ | lne|. 
This shows that the reconstruction of the potential q(y) is severely ill-posed. Indeed 
let us assume that the asymptotic coefficient of q in (|25|l are of the form a k ~ k~ M for 
some M, The I 2 error on the Fourier coefficients obtained by truncating all coefficients 
of index greater than |lne| is then of order N~ M+1 / 2 ~ |loge| _M+1 / 2 e a for all 
a > 0. 

8. Summary and Conclusions 

We have shown that if \i and a are known in say, x £ [0,x&], with xj, = h~ 1 (y*/2) 
(Eq. I18f) . one of them can be reconstructed from a single first-exit time distribution 
measurement at xq A. The procedure is formally carried out by transforming 
the backward Kolmogorov equation for the first-exit time distribution w(x; A) into 
a Schrodinger equation in u(y; A) with a potential q{y) that is a functional of n(x) 
and o(x). If this potential is known in the domain < y < yb = h(xb), q{y) can be 
reconstructed from the single eigenvalue spectrum obtained by analytically continuing 
the solution u(y; A) (in A) and finding the poles. It is this pole-finding procedure that 
is severely ill-posed. The asymptotic limit of the eigenvalues also determine the extent 
of the transformed domain y* . From the reconstructed q(y), one can reconstruct n(x) 
or <t(x), only if the other (or /j,(x)/a(x)) is also known. Finally, we have shown that 
the measurement process and the determination of spectra can be repeated a sufficient 
number of times to obtain q(y) (Theorem 3) and fi or a (Theorem 4) on the entire 
domain provided q(y) and a or \i is initially known on a small interval (0,e). 

Our analysis in section shows that the proposed inversion is severely ill-posed 
even if the hypotheses of the aforementioned theorems are satisfied. However the 
reconstruction of partial spectral data is mildly ill-posed so we believe the proposed 
method can be implemented. Let us conclude with the following remark. Our method 
is based on reconstructing the eigenvalues Afc from the knowledge of u(xq;A). The 
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latter may however provide additional information. For instance we observe from 1)210 
that the residuals (/>k(y)<P'k(y*) u (y*) can a l so be reconstructed from the measured data. 
It is known that the measurement of one spectrum {Afc} may be sufficient to uniquely 
reconstruct q(y) on the whole interval {0,y*) provided that additional normalization 
conditions be also measured on the eigenfunctions ■ It is not clear whether 

additional information on 4>k(y)4''k(y*) u (y*) ma y be useful. Preliminary numerical 
simulations seem to show that reconstructions from a single measurement only allows 
us to reconstruct part of the potential. 
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